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PCT09 



RAW SEQUENCE LISTING DATE: 02/25/2002 

PATENT APPLICATION : US/09/914 f 870 TIME : 11:45:48 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02252002\I914870.raw 

Does Not Comply 

6 <110> APPLICANT: Hartmann , Marcus /%*.~~*. i rv i ^ Ti . 

7 vohie, Peter Corrected Diskette Needea 

8 Tiedke, Arno 

9 Baumert, Uwe 

11 <120> TITLE OF INVENTION: a-Hexosaminidase and a DNA Sequence Coding it Obtained 

12 from Ciliates and Use thereof 
14 <130> FILE REFERENCE: 012080US 

16 <140> CURRENT APPLICATION NUMBER: 09/914870 

17 <141> CURRENT FILING DATE: 2000-03-03 

19 <150> PRIOR APPLICATION NUMBER: DE19958979.8 

20 <151> PRIOR FILING DATE: 1999-12-08 

22 <150> PRIOR APPLICATION NUMBER: DE19909189.7 

23 <151> PRIOR FILING DATE: 1999-03-04 
25 <160> NUMBER OF SEQ ID NOS : 3 

27 <170> SOFTWARE: Patentln Ver . 2.1 



ERRORED 


SEQUENCES 
















29 


<210> SEQ ID NO: 1 














30 


<211> LENGTH: 1656 














31 


<212> TYPE 


: DNA 














32 


<213> ORGANISM: Tetrahymena 












34 


<400> SEQUENCE: 1 










E- 


-> 


35 
36 
37 
38 


atgcaaaaga 


tacttttaat 


tactttcctt 


cttggaatag 


ctctcgctca 


aattactcct 


E- 


-> 


60 - 

ggcgttgacc 

i on 


ctatttcagc 


taaggttatg 


cctaaaccta 


agaattacac 


ttatggagat^ j 


E- 


-> 


39 
40 


ttgagcttac 

180 


ttgtcactga 


tccttgcgga 


gtctcttaca 


gaccttctgt 


tgggtcagga 


E- 


-> 


41 

42 


aaagtaccca 

240 


accatgtcta 


tcaaattatt 


ggattctaca 


ctttgaatat 


tttcaattct 


E- 


-> 


43 

44 


aacgaaaact 

300 


cttgtgctat 


gtaaagagaa 


ttgtataaga 


atgaaacaac 


cattgaaaag 


E- 


-> 


45 

46 


atgcgtagat 

360 


tacaacattc 


ctaaaatata 


gtcttcgata 


tttttatcta 


agacgctgct 


E- 


-> 


47 

48 


ttggccactg 

420 


cagacacact 


cgaagacgaa 


tattatgatt 


tataaattta 


taataccaca 


E- 


-> 


49 

50 


tattggaaat 

480 


tgactgctaa 


caaatatgtt 


ggtttactcc 


gtggtttaga 


aacttactct 


E- 


-> 


51 

52 


caattattca 

540 


cttaagacga 


agacactgaa 


gattggtatt 


tgaataacat 


ccctatttct 


E- 


-> 


53 


attcaagatt 


aacctgacta 


catctacaga 


ggtcttatga 


tagattcagc 


cagacatttc 
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PATENT APPLICATION: US/09/914,870 TIME: 11:45:48 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02252002\I914870.raw 







54 


600 












Li 


_ s 


56 


660 


aaaCUaLtUU 


dadddCLdLL. 


n a *t~ +■ "4- 

ydLUCLdUgt 


LdLLCddCdd 


fr ■4~ />* «a ^ /-t -4— -4~ 

g tug a a ug t u 


17- 


_ % 


D 1 

58 


CLCUa LLyyu 

720 


aCatCaCL^a 


LdCLyddLCC 


-4- -4— /-» « y-i -4— -4- « 

LLCCCCttCC 


f% +■ -4— -4— ^ ^ ^ -|— 

CLCLLdddtC 


"4— +- >-^t _ 4 — ^ *4— 
dLLCCCLddt, 




_ s 

- * 


60 


aLLaCtaaaC 

780 


atgyagcctd 


CtCtddyddg 


dddCddLdCd 


gcuucgaaga 


catXT-aaxiac 


fcj _ 


- ^ 


£ 1 

62 


attgtagact 

840 


aagctctcaa 


caagggtatt 


4- ~ — — .4.+. — 4-J- — 

taagtuat tc 


ctgaagtcga 


ttctccagga 


hi - 


- s 


64 


cacgcttttt 

900 

tataatggat 

960 


catgggctag 


atctccttaa 


ttctctagta 


-4-4-#w+-s<i4- -< 4- -4- 


atgtgattaa 


T-l 

hi - 


s 


D J 

66 


agttagaccc 


aacactaaat 


ttaacttaca 


ctgctgttaa 


gggtattatg 


III - 


- s 


£ 7 
D / 

68 


gaagatatga 

1020 


atacttaatt 


ctacactgct 


aagtatgttc 


attttggtgg 


tgatgaagtt 


TP _ 


_ s 


70 


gaagaa caa l. 
1080 


gc uggaa uaa 


acgccc ugaa 


ax-taaggaat 


tcatgaatta 


aaataacatc 


T? _ 
rj - 




7 1 
/ X 

72 


4— /-i -1— ^ ft ^ -1— ^ *4— ^ 

1140 


ft *4— ^ -|— -4— -1— -I— ^ 

cx-gat Ltg La 


fw ^ ^ -I— -4— ^ -I— -1— ^ ft 

yddLLdLLdC 


agaaagaact 


aagt naaca u 


ttggaaatca 


V - 


_ s 


7 \ 

74 


~1 -4- -4- 4- « n 4* ^ 

1200 


c Ldagcc ugc 




gcaga uucaa 


^ f* "4" -4-- ^» *^ 

a x. a c uxxg a a 


-4— -4— ft ff ft fm 

ai.aT.gy tCCX- 


TP — 




7 R 

76 


/"f ^ -I- f* ^ "I - ^ "I - -I - ^ 

gaLydLaLLa 

1260 


-4- -4-- « ^ -4- 

LLCaaLyy ty 


gg ga tcudct 


CdLgdttttL 


cc.T-caat.caa 


aga tct tcct. 


b - 




7 7 

78 


aacaaaataa 

1320 




ctatgataat 


acttatttgg 


atgttggtga 


gggaaataga 


E- 


- > 


79 

80 


tatggtggaa 

1380 


gttatggcag 


catgtataac 


tgggatgtct 


taaactcttt 


caatcctaga 


E- 


-> 


81 

82 


gttcctggaa 
1440 


ttaagggtga 


aattcttggt 


ggcgaaacat 


gcttatggag 


tgaaatgaat 


E- 


-> 


83 
84 


gatgattcta 

1500 


cttaattcta 


aagactttgg 


acaagaaata 


gtgcatttgc 


tgaaagactt 


E- 


-> 


85 

86 


tggaacactg 

1560 


atgctgctaa 


caatgaaact 


tacaaaacta 


gagctttagt 


tagcagaatg 


E- 


-> 


87 

88 


gtctttatgc 

1620 


aacaccgttt 


aactgctaga 


ggaatccctg 


cttctcctgt 


aacagttggt 


E- 


-> 


89 

90 


atttgtgaat 

1656 


aaaacctttc 


tctctgctac 


aattga 







206 <210> SEQ ID NO: 3 

207 <211> LENGTH: 1837 

208 <212> TYPE: DNA 

209 <213> ORGANISM: Tetrahymena 
211 <400> SEQUENCE: 3 

E--> 212 cagcagtaat aaaaaattct aaatatattg attgtagcta tgcaaaagat acttttaatt 

213 60 

E--> 214 actttccttc ttggaatagc tctcgctcaa attactcctg gcgttgaccc tatttcagct 

215 120 

E--> 216 aaggttatgc ctaaacctaa gaattacact tatggagatt tgagcttact tgtcactgat 

217 180 

E--> 218 ccttgcggag tctcttacag accttctgtt gggtcaggaa aagtacccaa ccatgtctat 
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219 240 

E--> 220 caaattattg gattctacac tttgaatatt ttcaattcta acgaaaactc ttgtgctatg 

221 300 

E--> 222 taaagagaat tgtataagaa tgaaacaacc attgaaaaga tgcgtagatt acaacattcc 

223 360 

E--> 224 taaaatatag tcttcgatat ttttatctaa gacgctgctt tggccactgc agacacactc 

225 420 

E--> 226 gaagacgaat attatgattt ataaatttat aataccacat attggaaatt gactgctaac 

227 480 

E--> 228 aaatatgttg gtttactccg tggtttagaa acttactctc aattattcac ttaagacgaa 

229 540 

E--> 230 gacactgaag attggtattt gaataacatc cctatttcta ttcaagatta acctgactac 

231 600 

E--> 232 atctacagag gtcttatgat agattcagcc agacatttct tatcagttga aactatttta 

233 660 

E--> 234 aaaactattg attctatgtt attcaacaag ttgaatgttc tccattggca catcactgat 

235 720 

E--> 236 actgaatcct tccccttccc tcttaaatca ttccctaata ttactaaata tggagcctac 

237 780 

E--> 238 tctaagaaga aacaatacag cttcgaagac atttaataca ttgtagacta agctctcaac 
239 840 

E--> 24 0 aagggtattt aagttattcc tgaagtcgat tctccaggac acgctttttc atgggctaga 

241 900 

E--> 242 tctccttaat tctctagtat tggtctatta tgtgattaat ataatggata gttagaccca 

243 960 

E--> 244 acactaaatt taacttacac tgctgttaag ggtattatgg aagatatgaa tacttaattc 

245 1020 

E--> 246 tacactgcta agtatgttca ttttggtggt gatgaagttg aagaataatg ctggaataaa 

247 1080 

E--> 248 cgccctgaaa ttaaggaatt catgaattaa aataacatct ctacatatac tgatttgtag 
249 1140 

E--> 250 aattattaca gaaagaacta agttaacatt tggaaatcaa ttaatgctac taagcctgct 

251 1200 

E--> 252 attttctggg cagattcaaa tactttgaaa tatggtcctg atgatattat tcaatggtgg 

253 1260 

E--> 254 ggatctactc atgatttttc ttcaatcaaa gatcttccta acaaaataat tttatctttc 

255 1320 

E--> 2 56 tatgataata cttatttgga tgttggtgag ggaaatagat atggtggaag ttatggcagc 

257 1380 

E--> 258 atgtataact gggatgtctt aaactctttc aatcctagag ttcctggaat taagggtgaa 

259 1440 

E--> 260 attcttggtg gcgaaacatg cttatggagt gaaatgaatg atgattctac ttaattctaa 

261 1500 

E--> 262 agactttgga caagaaatag tgcatttgct gaaagacttt ggaacactga tgctgctaac 

263 1560 

E--> 264 aatgaaactt acaaaactag agctttagtt agcagaatgg tctttatgca acaccgttta 

265 1620 

E--> 266 actgctagag gaatccctgc ttctcctgta acagttggta tttgtgaata aaacctttct 

267 1680 
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E--> 268 ctctgctaca attgattcta aatataaara ttaaataaat attttaagaa atatttttaa /)A^7>^ 

269 1740 /' 
E--> 270 gaatatttta gtataaaaac tgtattttaa ttgataaaaa aaatataaat attattatta 

271 1800 

E--> 272 attgaatttt agctaaaaaa aaaaaaaaaa aaaaaaa 

273 1 837 
E--> 27(^^5 -J) Jljjlxfc*^ 
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VERIFICATION SUMMARY DATE: 02/25/2002 

PATENT APPLICATION: US/09/914 , 870 TIME: 11:45:49 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02252002\I914870.raw 

L:35 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 60 SEQ : 1 
M:254 Repeated in SeqNo=l 

L:212 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:60 SEQ:3 
M:254 Repeated in SeqNo=3 
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STATISTICS SUMMARY DATE: 02/25/2002 

PATENT APPLICATION: US/09/914 , 870 TIME: 11:45:49 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02252002\I914870.raw 

Application Serial Number: US/09/914,870 
Alpha or Numeric: Numeric 
Application Class: 

Application File Date: 03-03-2000 
Art Unit: PCT09 

Software Application: PatentIN2.1 

Total Number of Sequences : 3 

Total Nucleotides: 3493 

Total Amino Acids: 54 9 

Number of Errors: 60 

Number of Warnings : 0 

Number of Corrections: 0 

MESSAGE SUMMARY 

254 E: 60 (No. of Bases conflict) 
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